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Abstract 

We give a general local central limit theorem for the sum of two indepen- 
dent random variables, one of which satisfies a central limit theorem while 
the other satisfies a local central limit theorem with the same order variance. 
We apply this result to various quantities arising in stochastic geometry, in- 
cluding: size of the largest component for percolation on a box; number of 
components, number of edges, or number of isolated points, for random ge- 
ometric graphs; covered volume for germ-grain coverage models; number of 
accepted points for finite-input random sequential adsorption; sum of nearest- 
neighbour distances for a random sample from a continuous multidimensional 
distribution. 
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1 Introduction 

A number of general central limit theorems (CLTs) have been proved recently for 
quantities arising in stochastic geometry subject to a certain local dependence. See 
[18, 19, 20, 21, 22] for some examples. The present work is concerned with local cen- 
tral limit theorems for such quantities. The local CLT for a binomial (n, p) variable 
says that for large n with p fixed, its probability mass function minus that of the 
corresponding normal variable rounded to the nearest integer, is uniformly o(n -1 / 2 ). 
The classical local CLT provides similar results for sums of i.i.d. variables with an 
arbitrary distribution possessing a finite second moment. Here we are concerned 
with sums of variables with some weak dependence, in the sense that the summands 
can be thought of as contributions from spatial regions with only local interactions 
between different regions. 
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Among the examples for which we obtain local CLTs here are the following. 
In Section 3 we give local CLTs for the number of clusters in percolation on a 
large finite lattice box, and for the size of the largest open cluster for supercritical 
percolation on a large finite box, as the box size becomes large. In Sections 4 and 
5 we consider continuum models, starting with random geometric graphs [18] for 
which we demonstrate local CLTs for the number of copies of a fixed subgraph (for 
example the number of edges) both in the thermodynamic limit (in which the mean 
degree is 0(1)) and in the sparse limit (in which the mean degree vanishes). For 
the thermodynamic limit we also derive local CLTs for the number of components 
of a given type (for example the number of isolated points), as an example of a 
more general local CLT for functionals which have finite range interactions or which 
are sums of functions determined by nearest neighbours (Theorem 5.1). This also 
yields local CLTs for quantities associated with a variety of other models, including 
germ-grain models and random sequential adsorption in the continuum. 

We derive these local CLTs using the following idea which has been seen (in 
somewhat different form) in [8], in [4], and no doubt elsewhere. If the random 
variable of interest is known to satisfy a CLT, and can be decomposed (with high 
probability) as the sum of two independent parts, one of which satisfies a local CLT 
with the same order of variance growth, then one can find a local CLT for the original 
variable. Theorem 2.1 below formalises this idea. The statement of this result has 
no geometrical content and it could be of use elsewhere. 

In the geometrical context, one can often use the geometrical structure to effect 
such a decomposition. Loosely speaking, in these examples one can represent a 
positive proportion of the spatial region under consideration as a union of disjoint 
boxes or balls, in such a way that with high probability a non-vanishing proportion 
of the boxes are 'good' in some sense, where the contributions to the variable of 
interest from a good box, given the configuration outside the box and given that it 
has the 'good' property, are i.i.d. Then the classical local CLT applies to the total 
contribution from good boxes, and one can represent the variable of interest as the 
sum of two independent contributions, one of which (namely the contribution from 
good boxes) satisfies a local CLT, and then apply Theorem 2.1. This technique 
is related to a method used by Avram and Bertsimas [1] to find lower bounds on 
the variance for certain quantities in stochastic geometry, although the examples 
considered here are mostly different from those considered in [1]. 

In any case, our results provide extra information on the CLT behaviour for 
variables for numerous geometrical and multivariate stochastic settings, which have 
arisen in a variety of applications (see the examples in Section 5). 
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2 A general local CLT 



In the sequel we let denote the standard (A/"(0, 1)) normal density function, i.e. 
4>(x) = (27r) -1 / 2 exp(— (l/2)x 2 ). Note that for a > 0, the probability density func- 
tion of the Af(0,cr 2 ) distribution is then a'^x/a), x G R. Define the Af(0, 0) 
distribution to be that of a random variable that is identically zero. 

We say a random variable X is integrable if E [\X\] < oo. We say X has a lattice 
distribution if there exists h > such that (X — a)/h G Z almost surely for some 
a G R. If X is lattice, then the largest such h is called the span of X, and here 
denoted h X - If X is non-lattice, then we set h x ■— 0. If X is degenerate, i.e. if 
Var[X] = 0, then we set hx '■= +oo. As usual with local central limit theorems, 
we need to distinguish between the lattice and non-lattice cases. For real numbers 
a > 0, b > 0, we shall write a\b to mean that either b is an integer multiple of a or 
a = 0. When a = +oo, b < oo we shall say by convention that a\b does not hold. 

Theorem 2.1. Let V, V\, V2, V3, . . . fre independent identically distributed random 
variables. Suppose for each n G N £/iai (F n , S^, Z n ) zs a triple of integrable random 
variables on the same sample space such that (i) Y n and S n are independent, with 

Sn = E?=i Vj; (™) b oth n^E [\Z n - (Y n + S n )\] and n l ' 2 P[Z n ^ Y n + S n ] tend to 
zero as n — )• 00; and (raj /or some G [0, 00), 

n~ 1/2 (Z n -EZ n ) ^AT(0,o 2 ) as n^oo. (2.1) 

Then Var[V] < a 2 and if b, c±, C2, C3, . . . are positive constants with hy\b and c n ~ 
n 1 / 2 as n — )> oo, iaen 



sup < c„P[Z n G [u,u + 6)] - o _1 60 I 



->■ as n ->■ 00. (2.2) 



Also, 



n 



-V2(y n _ E y n ) _^jV-(0,<r 2 -Vax[V]). ( 2 . 3 ) 



Remarks. The main case to consider is c n = n 1 ^ 2 . The more general formulation 
above is convenient in some applications, e.g., in the proof of Theorem 4.1. Theorem 
2.1 is proved in Section 7. Our main interest is in the conclusion (2.2), but (2.3), 
which comes out for free from the proof, is also of interest. 

3 Percolation 

Most of our applications of Theorem 2.1 will be in the continuum, but we start 
with applications to percolation on the lattice. We consider site percolation with 
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parameter p, where each site (element) of Z d is open with probability p and closed 
otherwise, independently of all the other sites. Given a finite set B C Z d , the open 
clusters in B are defined to be the components of the (random) graph with vertex 
set consisting of the open sites in B, and edges between each pair of open sites 
in B that are at unit Euclidean distance from each other. Let A(B) denote the 
number of open clusters in B. Listing the open clusters in B as Ci, . . . , Ca(b), and 
denoting by |Cj| the order (i.e., the number of vertices) of the cluster Cj, we denote 
by L(B) the random variable max(|Ci|, . . . , |Ca(b)|), and refer to this as the size 
of the largest open cluster in B. Given a growing sequence of regions (B n ) n >i in 
Z d , we shall demonstrate local CLTs for the random variables A(B n ) and L(B n ), 
subject to some conditions on the sets B n which are satisfied, for example, if they 
are cubes of side n. There should not be any difficulty adapting these results to 
bond percolation. 

For B C Z d let \B\ denote the number of elements of B. Let \dB\ denote the 
number of elements oiZ d \B lying at unit Euclidean distance from some element of 
B. We say a sequence (B n ) n >i of nonempty finite sets in Z d has vanishing relative 
boundary if 

lim \dB n \/\B n \ = 0. (3.1) 

n— >oo 

We write liminf(S n ) for U n >i H m >„ B m . 

Theorem 3.1. Suppose d > 2 and p E (0, 1). Then there exists a > such that 
if (B n ) n >i is any sequence of nonempty finite subsets in Z d with vanishing relative 
boundary and with liminf(_B„) = Z d , then 

\B n \-^ 2 (A(B n ) -EA(B n )) -^Af(0,a 2 ) (3.2) 



and 



sup 



\Bn\ 1/2 P[A(B n )=j]-a-^(t_^ 



->■ 0. (3.3) 



For the size of the largest open cluster we consider a more restricted class of 
sequences (B n ) n >i. Let us say that (B n ) n >i is a cube-like sequence of lattice boxes 
if each set B n is of the form IIj=i([ — a i,«? ^j,n] H Z), where a,j >n E N and b^ n E N for 
all j, n, and moreover 

inf {ai jTl , 6i, n , a 2 ,ni ^2,nj • • • j a d,ni bd,n} n fn a\ 

hmmf — ■ — — ■ — r- > (3.4) 

n^oo sup{ai jn , b 1>n , a 2 , n , b 2 ,n, • • • , a d>n , b dtn } 

which says, loosely speaking, that the sets B n are not too far away from all being 
cubes. 
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Given d > 2, and p G (0, 1), let 9d(p) denote the percolation probability, that is, 
the probability that the graph with vertices consisting of all open sites in 7L d and 
edges between any two open sites that are unit Euclidean distance apart includes 
an infinite component containing the origin. Let p c (d) denote the critical value of 
p for site percolation in d dimensions, i.e., the infimum of all p G (0, 1) such that 
9 d {p) > 0. It is well known that p c (d) G (0, 1) for all d > 2. 

Theorem 3.2. Suppose d>2 and p G (p c (d), 1). Then there exists a > such that 



if (B n ) n >i is any cube-like sequence of lattice boxes Z d with liminf(£> n ) 
have 



Z d , we 



B n \- 1/2 (L(B n ) -EL(B n )) ^ Af(0,a 2 ) 



v 



(3.5) 



and 



sup 



\B n \ 1/2 P[L(B n )=j]-a- 1 ( f ) 



j — E L(B n ) 
o-\B n \^ 



->■ 0. 



(3.6) 



Theorems 3.1 and 3.2 are proved in Section 8. Theorem 3.1 is the simplest of 
our applications of Theorem 2.1 and we give its proof with some extra detail for 
instructional purposes. 



4 Random geometric graphs 

For our results in this section and the next, on continuum stochastic geometry, let 
Xi,X 2 ,... be i.i.d. <i-dimensional random vectors with common density /. As- 
sume throughout that / max := sup x6R d /(x) < oo, and that / is almost everywhere 
continuous. Define the induced binomial point processes 

X n ■= Xntf) ■= {X u ...,X n }, neN. (4.1) 

In the special case where / is the density of the uniform distribution on the unit 
[0, l] d cube we write / = fjj. 

For locally finite X C M. d and r > 0, let Q(X, r) denote the graph with vertex set 
X and with edges connecting each pair of vertices x,y in X with \y — x\ < r; here 
| • | denotes the Euclidean norm though there should not be any difficulty extending 
our results to other norms. Sometimes Q(X, r) is called a geometric graph or Gilbert 
graph. 

Let (r n ) n >i be a sequence with r n — > as n — > oo. Graphs of the type of Q(X n , r n ) 
are the subject of the monograph [18]. Among the quantities of interest associated 
with G(X n , r n ) are the number of edges, the number of triangles, and so on; also the 
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number of isolated points, the number of isolated edges, and so on. CLTs for such 
quantities are given in Chapter 3 of [18] (see the notes therein for other references) 
for a large class of limiting regimes for r n . Here we give some associated local CLTs. 

Let k G N and let T be a fixed connected graph with k vertices. We follow 
terminology in [18]. With ~ denoting graph isomorphism, let G n be the number of 
/t-subsets y of X n such that Q(y,r n ) ~ T (i.e., the number of induced subgraphs of 
Q(X n ,r n ) that are isomorphic to T). Let G* (denoted J n in [18]) denote the number 
of components of Q(X n ,r n ) that are isomorphic to T. To avoid certain trivialities, 
assume that T is feasible in the sense of [18], i.e. that Q(X K ,r) is isomorphic to T 
with strictly positive probability for some r > 0. When considering G n , we shall 
also assume that k > 2. We shall give local CLTs for G n and G* n . 

We assume existence of the limit 



p := lim (nr*) < oo, (4.2) 



so that p could be zero. If p > then we are taking the thermodynamic limit. 
We also assume that 

t 2 := ninr^Y' 1 oo as n ->■ oo. (4.3) 

Then (see Theorems 3.12 and 3.13 of [18]) there exists a constant a = a(f, T, p) > 0, 
given explicitly in terms of /, Y and p in [18], such that 

lim r- 2 Var(G n ) = a 2 ; (4.4) 

n— >oo 

T -\G n -EG n )^N(0,a 2 ). (4.5) 
We prove here an associated local central limit theorem for the case / = /[/. 

Theorem 4.1. Suppose f = fu- Suppose k > 2, and suppose assumptions (4.2) 
and (4.3) hold. Then as n — >■ oo ; 



sup 

j el 



r n P[G n = j\-o- 



->■ 0. (4.6) 



We prove Theorem 4.1 in Section 9. It should be possible to obtain similar 
results for G*, but we shall do so only for the thermodynamic limit with p > 0, as 
an example in the next section. In the next section we shall see that for the case 
with p > 0, it is possible to relax the assumption that f = fu in Theorem 4.1; 
when p = 0, a similar extension to non-uniform densities should be possible, but we 
content ourselves here with the case f = fu so as to provide one example where the 
simplicity and the appeal of the approach do not get buried. 
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5 General local CLTs in stochastic geometry 



In this section we present some general local central limit theorems in stochastic 
geometry. We shall illustrate these by some examples in the next section. 

For our general local CLTs in stochastic geometry, we consider marked point sets 
in R d . Let M. be an arbitrary measurable space (the mark space), and let F M be a 
probability distribution on M.. Given x = (x, t) £ R d x Ai and given y £ R d , set 
y + x := (y + x,t). Given also a £ R, set ax = (ax,t). We think of t as a mark 
attached to the point x £ R d that is unaffected by translation or scalar multiplaction. 
Given X* C R d x M,y £ R d , and a £ (0,oo), let y + a** := {y + ax : x £ X*}. Let 
denote the origin of R d . For x £ R d , and r > 0, let £( x; r) denote the Euclidean 
ball {y £ R d : \y - x\ < r}, and set B*(x; r) := B(x; r) x M. Set S(r) := B(0; r) 
and _B*(r) := B*(0; r). Given nonempty A 1 * C l rf x M and y* cR d x M, write 

D(;t*,y) := mi{\x-y\ : G ^*,(?/,u) £ y* for some t,u £ M}. 

Let 0;^ denote the volume of the rf-dimensional unit ball B(l). 

Suppose H(X*) is a measurable M-valued function defined for all finite X* C 
M d x At. Suppose if is translation invariant, i.e. H(y + X*) = H(X*) for all y £ R d 
and all X*. 

Throughout this section we consider the thermodynamic limit; let r n ,n > 1 be 
a sequence of constants such that (4.2) holds with p > 0. Define 

H n (X*)'.= H(r- x X*). (5.1) 

Let the point process X n := {X ± , . . . ,X n } in R d be as given in (4.1), with / as in 
Section 4 (so / max < oo and / is Lebesgue-almost everywhere continuous). Define 
the corresponding marked point processs (i.e., point process in R d x M) by 

<^n '■= {(Xi, Ti), . . . , (X n , T n )}, 

where (T 1; T 2 , T 3 , . . .) is a sequence of independent A^-valued random variables with 
distribution P^, independent of everything else. We are interested in local CLTs 
for H n (X*), for general functions H. We give two distinct types of condition on H, 
either of which is sufficient to obtain a local CLT. 

We shall say that H has finite range interactions if there exists a constant r £ 
(0, oo) such that 

H(X* U y*) = H(X*) + H(y*) whenever D(X*,y*) > r. (5.2) 

In many examples it is natural to write H(X*) as a sum. Suppose £(x; X*) is a 
measurable R- valued function defined for all pairs (x, X*), where X* C R d x M is 
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finite and x is an element of X* . Suppose £ is translation invariant, i.e. £(y + x; y + 
X*) = £(x; <¥*) for all y G-W 1 and all x, A"*. Then £ induces a translation-invariant 
functional defined on finite point sets X * C R d x M by 

tf«>(**):= ^(x;**). (5.3) 

Given r G (0, oo) we say £ has range r if £((z, t); X*) = £((x,t);X* n B*(x)) for all 
finite A"* C K d x .M and all (x,t) G A"*. It is easy to see that if £ has range r for 
some (finite) r then has finite range interactions, although not all H with finite 
range interactions arise in this way. 

Let kGN. Given any set X* G M. d x Ai with more than k elements, and given 
x = (x, t) G X*, set -R K (x; X*) to be the K-nearest neighbour distance from x to X*, 
i.e. the smallest r > such that X* fl B*(x; r) has at least k elements other than x 
itself. If X* has k or fewer elements, set i? K (x; A"*) := oo. 

We say that £ depends only on the k nearest neighbours if for all x and X* , 
writing x = (x,t) we have 

£(x;A'*) = £(x;A'*n J B*( a :; J R K (x;A'))). 

We give local CLTs for H under two alternative sets of conditions: either (i) when H 
has finite range interactions, or (ii) when H is induced, according to the definition 
(5.3), by a functional £(x; X*) which depends only on the k nearest neighbours, for 
some fixed k. 

Given K > and n G N, define point processes U n; K, and Z n in M. d , and point 
processes U* K , and 2* in x A^, as follows. Let U n> K denote the point process 
consisting of n independent uniform random points U\,k-, • • • , U Ht x in B(K), and let 
Z n be the point process consisting of n independent points Z\,...,Z n in M, d , each 
with a rf-dimensional standard normal distribution (any other positive continuous 
density on M, d would do just as well). The corresponding marked point processs are 
defined by 

K,k ■■={(Ui,K,T 1 ),...,(U n>K ,T n )}; 
Z* n :={(Z 1 ,T 1 ),...,(Z n ,T n )}. 

Define the limiting span 

h(H):=hmm£h H{Zi) . (5.4) 

n— >oo 

Theorem 5.1. Suppose that either (i) H has finite range interactions and hn(z*) < 
oo for some jigN, or (ii) for some k G N, H is induced by a functional £(x; X*) 
which depends only on the k nearest neighbours, and hn(z*) < oo for some n G N 
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with n > k. Suppose also that H n (X*) and H(U* K ) are integrable for all n G N and 
K > 0. Finally suppose that 



n 



-1/2 



(H n (X*) — EH n (X*)) A^(0 )( t 2 ) asn^oo. (5.5) 



Then a > and h(H) < oo and for any b G (0, oo), with h{H)\b, 



sup 



n^P[H n (K) e[u,u + b)] - a-^ ( U 



— >■ as n — > oo. 

(5.6) 



We prove Theorem 5.1 in Section 10. Analogues to this result and to Theorem 
4.1 should also hold if one Poissonizes the number of points in the sample, but we 
do not give details. 

The corresponding result for unmarked point sets in M. d goes as follows; we adapt 
our terminology to this case in an obvious manner. 

Corollary 5.1. Suppose H(X) is M.-valued and defined for all finite X C M. d . Sup- 
pose H is translation invariant, and set H n (X) := H^^X). Suppose that either (i) 
H has finite range interactions and h HtyZn ) < oo for some n G N, or (ii) for some 
k G N 7 H is induced by a functional £,(x; X) which depends only on the k nearest 
neighbours, and hu{z n ) < oo for some n G N with n > k. Suppose also that H n (X n ) 
and H(lA n ^K) (ire integrable for all n G N and K > 0. Finally suppose 

n- 1/2 (H n (X n ) -EH n (X n )) A A/"(0 )( t 2 ) as n -> oo. (5.7) 
Then a > and h(H) < oo and for any b G (0, oo), with h(H)\b, 



sup 



n^P[H n {X n ) e[u,u + b)] - a-fy ( - 



— > as n — > oo. 

(5.8) 



Corollary 5.1 is easily obtained from Theorem 5.1 by taking M. to have just a 
single element, denoted t say, and identifying each element (x,t ) eK^xM with 
the corresponding element x of M. d . 

To apply Theorem 5.1 in examples, we need to check condition (5.5). For some 
examples this is best done directly. However, if we strengthen the other hypotheses 
of Theorem 5.1, we can obtain (5.5) from known results and so do not need to 
include it as an extra hypothesis. The next three theorems illustrate this. As well 
(5.5), these results give us the associated variance convergence result 

lim n^Yai[H n {X*)} = a 2 . (5.9) 
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In the next three theorems, we impose some extra assumptions besides those of 
Theorem 5.1. Writing supp(/) for the support of /, we shall assume that supp(/) 
is compact, and that also r n satisfy 



which implies (4.2) with p — 1. We also assume certain polynomial growth bounds; 
see (5.11), (5.13) and (5.14) below. 

First consider the case where H = is induced by a functional £(x; X*) with 
finite range r > 0. For any set A, let card(A) denotes the number of elements of A. 

Theorem 5.2. Suppose H = H^> is induced by a translation invariant functional 
£(x; X*) having finite range r and and satisfying for some 7 > the polynomial 
growth bound 



\£((x, t); X*)\ < 7 (caxd(#* n B*(x;r))f V finite X* C W x M, V (x,t) G X* . 



Suppose hn(z*) < 00 for some n G N, and suppose supp(/) is compact. Finally, 
suppose that (5.10) holds. Then there exists a G (0, 00) such that (5.5) and (5.9) 
hold, and h(H) < 00 and (5.6) holds for all b with h(H)\b. 

Now we turn to the general case of Condition (i) in Theorem 5.1, where H has 
finite range interactions but is not induced by a finite range £. For this case we 
shall borrow some concepts from continuum percolation. For A > 0, let "Ha denote a 
homogeneous Poisson point process in M. d with intensity A. Let T-L\ denote the same 
Poisson point process with each point given an independent A^-valued mark with 
the distribution P_m. 

Let A c be the critical value for percolation in d dimensions, that is, the supremum 
of the set of all A > such that the component of the geometric (Gilbert) graph 
G{%\ U {0}, 1) containing the origin is almost surely finite. It is known (see e.g. 
[18]) that < A c < 00 when d > 2 and A c = 00 when d — 1. 

For nonempty X C M. d , write diam(A') for sup{|x — y\ : x, y G X}. For X* C 
M. d x Ai, write diam(A'*) for diam(7r(A'*)), where n denotes the canonical projection 
from l^xM onto R d . 

Theorem 5.3. Suppose H(X*) is a measurable R-valued function defined for all 
finite X* C K^xM, and is translation invariant. Suppose supp(/) is compact. 
Suppose for some r > that the finite range interaction condition (5.2) holds, and 
suppose f and r satisfy the subcriticality condition 




0(n 1/a ), 



(5.10) 



(5.11) 



T /max < A, 



•ci 



(5.12) 
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Assume (r n )„>i satisfies (5.10), and suppose also that hn(z*) < oo for some n£N, 
and that there exists a constant 7 > such that for all finite non-empty X* C M. d 
we have 

H{X*) < 7 (diain(#*) + card(A'*)) 7 . (5.13) 

Then there exists a G (0, 00) such that (5.5) and (5.9) hold, and h(H) < 00 and if 
b G (0,oo) with h{H)\b, then (5.6) holds. 

Now we turn to condition (ii) in Theorem 5.1. Following [24], we say that a 
closed region A C lR d is a d-dimensional C 1 submanifold-with-boundary of M, d if it 
has a differentiable boundary in the following sense: for every x in the boundary dA 
of A, there is an open U C M d , and a continuously differentiable injection g from U 
to M d , such that G £/ and #(0) = x and #([/ n ([0, 00) x M^ 1 )) = #([/) n A. 

Theorem 5.4. Lei k G N. Suppose H = is induced by a £ which depends 
only on the k nearest neighbours, and for some 7 G (0, 00) suppose we have for all 
(x,X*) that 

|£(x;**)| < 7 (l + R K (x,X*)y. (5.14) 

Suppose also that supp(/) zs either a compact convex region in M. d or a compact d- 
dimensional submanifold-with-boundary ofM. d , and suppose f is bounded away from 
zero on supp(/). Finally suppose that the sequence (r n ) n >i satisfies (5.10), and that 
hH{z*) < 00 for some n G N with n > k. Then there exists a G (0, 00) such that 
(5.5) and (5.9) hold, and h(H) < 00 and if b G (0, 00) with h(H)\b then (5.6) also 
holds. 

We prove Theorems 5.2, 5.3 and 5.4 in Section 11. In proving each of these 
results, we apply Theorem 5.1, and check the CLT condition (5.5) using a general 
CLT from [20], stated below as Theorem 11.1. 

The conclusion that a > in Theorems 5.1-5.4 and Corollary 5.1 is noteworthy 
because the result from [20] on its own does not guarantee this. Our approach to 
showing a > here is related to that given in [1] (and elsewhere) but is more generic. 
A different approach to providing generic variance lower bounds was used in [21] 
and [3] but is less well suited to the present setting. 

6 Applications 

This section contains discussion of some examples of concrete models in stochastic 
geometry, to which the general local central limit theorems presented in Section 5 
are applicable. Further examples where the conditions for these general theorems 
can be verified are discussed in [20, 21, 22, 23]. 



12 



6.1 Further quantities associated with random geometric 
graphs 

Suppose the graph Q(X n ,r n ) is as in Section 4. We assume here that (4.2) holds 
with p > 0. Theorem 5.1 enables us to extend the case p > of Theorem 4.1 to 
non-uniform /. It also yields local CLTs for some graph quantities not covered by 
Theorem 4.1; we now give some examples. 

Number of components for Q(X n ,r n ). This quantity can be written in the form 
H n (X n ), where H(X) is the number of components of the geometric graph Q(X, 1) 
(which clearly has finite range interactions). In the the thermodynamic limit, this 
quantity satisfies the CLT (5.7) (see Theorem 13.26 of [18]). Therefore, Corollary 

5.1 is applicable here and shows that it satisfies the local CLT (5.8). 

Number of components for Q (X n , r n ) isomorphic to a given feasible graph Y. This 
quantity, denoted G* n in Section 4, can be written in the form H n (X n ), with H(X) 
the number of components of G(X, 1) isomorphic to T. Clearly, this H has finite 
range interactions since (5.2) holds for r = 2. Also, it satisfies (5.7) by Theorem 
3.14 of [18]. Therefore we can apply Corollary 5.1 to deduce (5.8) in this case. 

Independence number. The independence number of a finite graph is the maximal 
number k such that there exists a set of k vertices in the graph such that none of 
them are adjacent Clearly this quantity is the sum of the independence numbers 
of the graph's components, and therefore if for X C M d we set H{X) to be the 
independence number of Q(X,r) (also known as the off-line packing number since 
it is the maximum number of balls of radius r/2 that can be packed centred at 
points of X) then H satisfies the finite range interactions condition (5.2) with r = 2. 
Therefore we can apply Theorem 5.3 to derive a local CLT for the independence 
number of Q(X n ,r n ), as follows. 

Theorem 6.1. . Let r > and suppose (5.12) holds. Suppose r n is satisfies (5.10). 
Then if for X C M. d we set H(X) to be the independence number of Q(X,t), then 
there exists a G (0, oo) such that (5.7) holds, and if b G N then (5.8) holds. 

6.2 Germ-grain models 

Consider a coverage process in which each point has an associated mark Tj, the 
Tj (defined for % > 1) being i.i.d. nonnegative random variables with a distribution 
having bounded support (i.e., with P[T$ < K] — 1 for some finite K). Define the 
random coverage process 

~ n :=VJUB{r- l X t] T^. (6.1) 
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For U a finite union of convex sets in R d , let \U\ denote the volume of U (i.e. its 
Lebesgue measure) and let \dU\ denote the surface area of U (i.e. the (d — 1)- 
dimensional Hausdorff measure of its boundary). 

Theorem 6.2. Under the above assumptions, if (5.10) holds then there exists a > 

and a >0 such that n -1 / 2 (|S n | -E |S n |) A^(0,(r 2 ) andn-^ddZ^-E \8E m \) A 
Af(0,a 2 ), and moreover for any b G (0, oo), 

n 1/2 P[|S n | G [u, u + 6)] — o~ ^b(f) 



sup 



1/2, 



->■ 



as n — )■ oo. 



(6.2) 



and 



sup 



n 1/2 P[|<9~„| G [u,u + 6)] -b- x ty 



n l / 2 a 



—7-0 as n — > oo. 



(6.3) 



Proof. The volume |S n | can be viewed as a functional H n (X*), where H{X) = 
H®(X*) with ^((a:,*);^*) given by the volume of that part of the ball centred at 
x with radius given by the associated mark t, which is not covered by any corre- 
sponding ball for some other point x' G X with x' preceding x in the lexicographic 
ordering. Since we assume the support of the distribution of the T; L is bounded, this 
£ has finite range r = 2K. Moreover, it satisfies the polynomial growth bound (5.11) 
so by Theorem 5.2 we get the CLT (5.5) and local CLT (5.6) for any b > (in this 
example h(H) = 0). Thus we have (6.2). 

Turning to the surface area |9S n |, this can also be viewed as a functional H n (X n ) 
for a different H = this time taking £(x; X) to be the uncovered surface area of 
the ball at x, which again has range r = 2K and satisfies (5.11). Hence by Theorem 
5.2. we get the CLT (5.5) and local CLT (5.6) for any b > for this choice of H (in 
this example, again h(H) = 0). Thus we have (6.3). □ 

Remark. The preceding argument still works if the independent balls of random 
radius in the preceding discussion are replaced by independent copies of a random 
compact shape that is almost surely contained in the ball B(K) for some K (cf. 
Section 6.1 of [20]). 

Other functionals for the germ-grain model. When / = fu, the scaled point 
process r n l ^ d X n can be viewed as a uniform point process in a window of side r n 1 ^ d . 
CLTs for a large class of other functionals on germ-grain models in such a window 
are considered in [13], for the Poissonised point process with a Poisson distributed 
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number of points. Since the Poissonised version of Theorems 5.1 and 5.2 should also 
hold, it should be possible to derive local CLTs for many of the quantities considered 
in [13], at least in the case where the grains (i.e., the balls or other shapes attached 
to the random points) are of uniformly bounded diameter. 

6.3 Random sequential adsorption (RSA). 

RSA (on-line packing) is a model of irreversible deposition of particles onto an 
initially empty ci-dimensional surface where particles of fixed finite size arrive se- 
quentially at random locations in an initially empty region A of a rf-dimensional 
space (typically d = 1 or d = 2), and each successive particle is accepted if it does 
not overlap any previously accepted particle. The region A is taken to be compact 
and convex. The locations of successive particles are independent and governed by 
some density / on A. In the present setting, we take the mark space M. to be 
[0, 1] with ¥ M the uniform distribution. Each point x = (x, t) of X* represents 
an incoming particle with arrival time t. The marks determine the order in which 
particles arrive, and two particles at x = (x, t) and y = (y, u) are said to overlap if 
\x — y\ < 1. Let H(X*) denote the number of accepted particles. This choice of H 
clearly has finite range interactions ((5.2) holds for r = 2). 

Then H n (X*) represents the number of accepted particles for the re-scaled marked 
point process r~ l X*\ note that the density / and hence the region A on which the 
particles are deposited, does not vary with n. At least for r n = the central 

limit theorem for H n (X n ) is known to hold; see [22] for the case when A = [0, l] d 
and / = fu and [3] for the extension to the non-uniform case on arbitrary com- 
pact convex A (note that these results do not require the sub-criticality condition 
(5.12) to be satisfied). Thus, the H under consideration here satisfies the condition 
(5.5). Therefore we can apply Theorem 5.1 to obtain a local CLT for the number of 
accepted particles in this model. 

Theorem 6.3. Suppose f has compact convex support and is bounded away from 
zero and infinity on its support. Suppose r n = n~ x l d , and suppose Z n = H n (X*) is 
the number of accepted particles in the rescaled RSA model described above. In other 
words, suppose Z n be the number of accepted particles when RSA is performed on 
X n with distance parameter r n = n~ l l d . Then there is a constant a e (0, oo) such 
that (2.1) holds and for b = 1 and c = n 1 ^ 2 , (2.2) holds. 

It is likely that in the preceding result the condition r n = n~ x l d can be relaxed 
to (4.2) holding with p > 0. We have not checked the details. 

In the infinite input version of RSA with range of interaction r, particles continue 
to arrive until the region A is saturated, and the total number of accepted particles 
is a random variable with its distribution determined by r. A central limit theorem 
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for the (random) total number of accepted particles (in the limit r — > 0) is known to 
hold, at least for / = fu\ see [25]. It would be interesting to know if a corresponding 
local central limit theorem holds here as well. 

6.4 Nearest neighbour functionals 

Many functionals have arisen in the applied literature which can be expressed as 
sums of functionals of ^-nearest neighbours, for such problems as multidimensional 
goodness-of-fit tests [5, 2], multidimensional two-sample tests [14], entropy estima- 
tion of probability distributions [17], dimension estimation [16], and nonparametric 
regression [10]. Functionals considered include: sums of power- weighted nearest 
neighbour distances, sums of logarithmic functions of the nearest-neighbour dis- 
tances, number of nearest-neighbours from the same sample in a two-sample prob- 
lem, and others. Central limit theorems have been obtained explicitly for some of 
these examples [5, 14, 2] and in other cases they can often be derived from more 
general results [1, 20, 21, 7]. Thus, for many of these examples it should be possible 
to check the conditions of Theorem 5.1 (case (ii)). 

We consider just one simple example where Theorem 5.4 is applicable. Suppose 
for some fixed a > that H(X) is the sum of the a-power-weighted nearest neigh- 
bour distances in X (for a = 1 this is known as the total length of the directed 
nearest neighbour graph on X). That is, suppose H(X) = H^(X) with £(x;X) 
given by min{|y — x\ a : y £ X \ {x}}. Then H n (X) = r~ a H(X), and £ clearly 
satisfies (5.14) for some 7, so provided / is supported by a compact convex region in 
M. d or by a compact (/-dimensional submanifold-with-boundary of M d , and provided 
/ is bounded away from zero on its support, Theorem 5.4 is applicable with k — 1. 
Hence in this case there exists a G (0, 00) such that (5.5) and (for any b G (0, 00)) 
(5.6) are valid. 

7 Proof of Theorem 2.1 

Let V, Vi, V2, V3, . . . be independent identically distributed random variables. Define 
ay := ^/Var(V) G [0, oo]. In the case ay = 0, Theorem 2.1 is trivial, so from now 
on in this section, we assume ay > 0. Let b, Ci, c 2 , c 3 , . . . be positive constants with 
hy\b and c n ~ n 1 / 2 as n — > oo. 

We prove Theorem 2.1 first in the special case where Z n = S n , then in the case 
where Z n = Y n + S n , and then in full generality. Before starting we recall a fact 
about characteristic functions. 
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Lemma 7.1. If ay — 00 then for all t 6l, os n -)■ 00 



E exp itn~ 1/2 J^(V) - E [V]) 1 - > 0. 
Proof. See for example Section 3, and in particular the final display, of [26]. □ 



Lemma 7.2. Suppose S n = YTj=i Vj an< ^ a v < °°. Then as n — >■ oo ; 



sup 



c n P[S n G [w,u + 6)] -<T ^ 



U-E5,, 



->■ 



(7.1) 



Proof. First consider the special case with c n = n 1 / 2 . In this case, (7.1) holds by 
the classical local central limit theorem for sums of i.i.d. non-lattice variables with 
finite second moment in the case where hy = (see page 232 of [6], or Theorem 
2.5.4 of [9]), and by the local central limit theorem for sums of i.i.d. lattice variables 
in the case where hy > and b/hy 6 Z (see Theorem XV.5.3 of [11], or Theorem 
2.5.2 of [9]). 

To extend this to the general case with c n ~ n 1//2 , observe first that by the special 
case considered above, n l ^ 2 P[S n G [u,u + b)} remains bounded uniformly in u and 
n, and hence 

sup{| (n 1/2 -c n )P[S n G [u,u + b)} [} = sup 



u<=R 



n 



1/2 



1 



n 



1/2 



P[S n e[u,u + b)]} 



->■ 0. 



(7.2) 



Also, for any K > 1, 



sup 

\x\<Kn 1 /2 



n 



1/2 



1 < (27re)~ 1/2 sup 



x 



< {2-ney 1 / 2 



1 - 



n l/2 
n l/2 



->■ 0. 



(7.3) 



Also, for large enough n, 

sup max 

and since K is arbitrarily large, combined with (7.3), this shows that 



sup 



X 



71 



1/2 



->■ 0. 



Combined with (7.2), this shows that we can deduce (7.1) for general c n satisfying 
c n ~ n 1 / 2 from the special case with c n = n 1 / 2 which was established earlier. □ 
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Lemma 7.3. Theorem 2.1 holds in the special case where Z n = Y n + S n . 



Proof. Assume, along with the hypotheses of Theorem 2.1, that Z n = Y n + S n . 
Considering characteristic functions, by (2.1) we have for t G R that 



E [exp (itn- 1/2 (Y n -EY n ))] E [exp (it n - 1/2 (S n -ES n ))] 

— > exp(— \to- 2 ). 



(7.4) 



If ay = oo then by Lemma 7.1, the second factor in the left hand side of (7.4) tends 
to zero, giving a contradiction. Hence we may assume ay < oo from now on. 
By the Central Limit Theorem, 



n 



~ 1/2 (S n -ES n ) N(0,ay) 



(7.5) 



By (7.4) and (7.5), a v < a 2 and setting a\ := a 2 -ay > 0, we have that n l/2 (Y n 
EY n ) is asymptotically jV(0, cry). Hence, 



c-\Y n -EY n ) AaT(0,4). 



(7.6) 



That is, (2.3) holds. 
Let u E R and set 



t := t(u,n) := c n l {u — EZ n ). 
Assume that Z n = Y n + S n . By independence of Y n and S n , 

P[Z n E[u,u + b)]= P[c-\Z n -E[Z n ]) ec- l [u-EZ n ,u + b-EZ n )] 
' ' :ZY n e d J p P»~ ElS " e c -i[ u — E Z n , u + b — E Z n ) — x 



(7.7) 



so that 



/oo 
p 



G ax 



x (c n P [S n - E S n E [u — EZ n — xc n , u — EZ n — xc n + b)]) 



P 



Y - EY 



(c n P [S n -ES n e [(t- x)c n , (t - x)c n + b)}) 



By Lemma 7.2, 



c n P [S n -ES n e [yc n , yc n + b)] 



0~v \0~v 



+ 9n{y) 
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where 



sup \g n (y)\ — > as n — >■ oo. 



(7.* 



Hence, 

c n P[Z n G [u,u + 6)] = E 



6 ft-c-\Y n -EY n ) 

— c 

o"y 



cry 



+ <?n ({-^(yn-EK)) 



so by (7.8), to prove (2.2), it suffices to prove 
_ lA ft(u,n)-c- 1 {Y n -EY n ) 



sup 



E 



'v 



-<T"V 



->■ 0. 



(7.9) 



Suppose this fails. Then there is a strictly increasing sequence of natural numbers 
(n(m),m > 1) and a sequence of real numbers (u m ,m > 1) such that with t m := 
t(u m ,n(m)), we have 



lim inf 

m— >oo 



E 



<7, 



-Ij, I m C n(m)(^"( m ) Ey„( m ) y 



\il ^"in E Z n ( m ^ 



> 0. 



(7.10) 



By taking a subsequence if necessary, we may assume without loss of generality, 
either that t m — > t for some t G R, or that or that |i m | — > oo as m — > oo. Consider 
first the latter case. If |t m | — > oo as m — > oo, then by (7.6), 

^0*m - C nL(^M - E ^M)I < l*m|/2] < ^[^^(^(m) " EY n(m) )\ > \t m \/2] 



^0, 



and hence 



E 



-Ij. I m C n(m)(^"( m ) Ey„( m ) y 



->■ 0. 



Since c n L(«* m - EZ„( m )) is equal to t m by (7.7), we also have under this assumption 
that a~ V (m) tends to zero, and thus we obtain a contradiction of (7.10). 

In the case where t m — > t for some finite t, we have by (7.6) that t m —c~}JY n ( m ) — 
Ey n ( m )) converges in distribution to t — Wi, where W\ ~ A/"(0, <7y). Hence as 
m — ?■ oo, 



E 



a 



_ lA (tm C „( m )(^n(m) EF n ( m )) 

0"y 



= E/^ 2 (t-H/ 1 ), 
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where W2 ~ N(0,o~y), with probability density function fw 2 ( x ) '■= °'v 1( t > { x l (7 v)- If 
we assume Wi, Wi are independent, then E fw 2 {t ~ Wi) is the convolution formula 
for the probability density function of W\ + W2, which is jV(0, a 2 ), so that 

Ef W2 (t - W 1 ) = f Wl+Wa (t) = a-^t/a). 

On the other hand, since c~^(u m — EZ„( m )) is equal (by (7.7)) to t m which we 
assume converges to t, we also have that 



it. 



EZ 



n(m) 



and therefore we obtain a contradiction of (7.10) in this case too. 

Thus (7.10) fails, and therefore (7.9) holds. Hence, (2.2) holds in the case with 
Z n = Y n + S n . □ 



Proof of Theorem 2.1. Set Z' n := Y n + S n . By the integrability assumptions, Z' n 
is integrable. By (2.1) and the assumption that n _1 / 2 E [\Z n — Z' n \] — > as n — > 00, 



n 



- 1/2 (Z'-EZ' 



v 



A/"(0,cr 5 



as n — > 00. 



(7.11) 



Let 6 > with By Lemma 7.3, a 2 > VaxV and (2.3) holds and 



sup 



c„P[Z; G [u,u + 6)] - (7 x 60 



u-EZ> n 



->■ 



as n — )■ 00. 



Hence, by the assumption n 1 / 2 P[Z n 7^ Z^] — >■ 0, 

'u-RZL 



sup 



c n P[Z„ G [u, u + fc)] — o" 1 b(f) 



->■ 



as n — )> 00, 



and since the assumption n" 1 /^ [\Z n -Z' n \] ->■ implies that c'^E [Z„]-E [Z^]) ->■ 
as n — )• 00, and is uniformly continuous on R, we can then deduce (2.2). □ 



8 Proof of theorems for percolation 

We shall repeatedly use the following Chernoff-type tail bounds for the binomial 
and Poisson distributions For a > set <p(a) := 1 — a + a log a. Then ip{\) = and 
(p(a) > for a G (0, 00) \ {1}. 

Lemma 8.1. If X is a binomial or Poisson distributed random variable with E [X] = 
/i > 0. Then we have for all x > that 

P[X >x]< exp(-fjL(p(x/n)), x>fi; (8.1) 
P[X <x}< exp{-fi V {x/fi)), x<fi. (8.2) 
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Proof. See e.g. Lemmas 1.1 and 1.2 of [18]. 



□ 



Proof of Theorem 3.1. Let (-B n )„>i be a sequence of nonempty finite subsets in Z d 
with vanishing relative boundary. The first conclusion (3.2) follows from Theorem 
3.1 of [19], so it remains to prove (3.3). 

For x G Z d let ||x||oo denote the foo-norm of x, i.e., the maximum absolute value 
of its coordinates. Let B° be the set of points x in B n such that all y G Z d with \\y — 
x\\oo < 1 are also in B n . Since \B n \ B°\/\dB n \ is bounded by a constant depending 
only on d, the vanishing relative boundary condition (3.1) implies |5°|/|5 n | — > 1 as 
n — > oo. 

Hence, by the pigeonhole principle, for all large enough n we can choose a set 
of points x nt i, x Uj 2, ■ ■ ■ , ^n,L5" d |s„|/2j i n B° such that \\x n j — £ n ,fc||oo > 3 for each 
distinct j, k in {1, 2, ... , [_5 -d |-B n |/2j } (let these points be chosen by some arbitrary 
deterministic rule). 

For 1 < j < |_5~ d |-B n |/2j , let I n j be the indicator of the event that each vertex 
y G Z d with \\y — XnjWoo = 1 is closed, and list the j for which I n j = 1, in increasing 

order, as J(n, 1) . . . , J(n, N n ), where N n := Y^jLi ' B "' //2 ^ I n ,j- Let I' n j be the indicator 
of the event that the vertex x n j is itself open. Then N n is binomially distributed 
with parameter (1 — p) 3 *" 1 , so by Lemma 8.1, 

limsup iB^-MogP^ < 5~ d (l -pf-^B^/A] < 0. (8.3) 

Set b n := [5~ d (l — p) 3d_1 |-B n |/4j . Let Vi,V 2 , . . . be a sequence of independent 
Bernoulli variables with parameter p, independent of everything else. Recalling 
that A(B) denotes the number of open clusters in B, set 

min(6 n ,iV„) 

S'n := l'n,J(n,j)i '■= A(B n ) - S' n , 

and 

(bn-N n ) + 

S n :=S' n + V v 

3=1 

where x + := max(x, 0) as usual, and the sum Y^=i ^ s taken to be zero. 

In this case, the 'good boxes' discussed in Section 1 are the unit ^oo-neighbourhoods 
of the sites z n ,j(n,i), £n,j(n,2), • ■ ■x n ,j(n,mm(b n ,N n ))- If x n j is at the centre of a good 
box, it is (if open) isolated from other open sites, so that Y n is simply the number 
of open clusters in B n if one ignores all sites x nt jr n j\ (1 < j < min(fe n , N n )). Hence 
Y n does not affect the open/closed status of these sites. 
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Thus S n has the Bin(6 n ,p) distribution and its distribution, given Y n , is unaf- 
fected by the value of Y n so S n is independent of Y n . Also, 

(b n -N n )+ 

A(B n )-(Y n + S n ) = S' n -S n = - V i 

3=1 

so that by (8.3), both \B n \ l / 2 P[X(B n ) ± Y n + S n ] and \B n \-V 2 E [\A(B n ) - (Y n + S n )\\ 
tend to zero as n — > oo. Combined with (3.2) this shows that Theorem 2.1 is appli- 
cable, with hy = 1, and that result shows that (3.3) holds. □ 

In the proof of Theorem 3.2, and again later on, we shall use the following. 

Lemma 8.2. Suppose £i, . . . , £ m are independent identically distributed random el- 
ements of some measurable space (E,£). Suppose m e N and ip : E m — > R is 
measurable and suppose for some finite K that for j — 1, . . . , m, 

K> sup . . . ,Xj, . . . ,x m ) - i/j(xi, . . . ,x'j, . . . ,x m )\. 

(x 1 ,...,x m ,x' j )eE m + 1 

Set Y = ?p(£,i, . . . , £ m ). Then for any t > 0, 

P[\Y-EY\ >t}< 2exp(-t 2 /(2mK 2 )). 

Proof. The argument is similar to e.g. the proof of Theorem 3.15 of [18]; we 
include it for completeness. For 1 < i < m let T% be the a-algebra generated 
by £i, • • • , £i, and let J-q be the trivial a- algebra. Then Y — E [Y] = Di with 
Di := E [Vl-Fi] — E [Y\ J-i-i], the ith martingale difference. Then with ^ independent 
of £i, . . . , £ m with the same distribution as them, we have 

so that < K almost surely and hence by Azuma's inequality (see e.g. [18]) we 
have the result. 

Proof of Theorem 3.2. Assume d > 2 and p > p c (d). Let (B n ) n >i be a cube-like 
sequence of lattice boxes in Z d . For finite nonempty A C Z d we define the diameter 
of A, written diam(A), to be max{||x — yW^ : x E A,y <E A}. 

Set 7 n := [diam(5 n ) 1 ^ 4 ^] . Let -B" 1 be the set of points x in B n such that all 
y G Z d with ||y — x||oo < 7 ra are also in B n . Then we claim that |B™|/|B n | -)• 1 as 
n — > oo. Indeed, writing B n = rij=i([ — a j>> HZ), from the cube-like condition 
(3.4) we have for 1 < j < d that 7 n = o[a^ n + bj >n ) as n — > oo, and therefore 

d d 

ra = n^> + a ^ - = (! + °w) n^> + 

J=l 3=1 
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justifying the claim. 

By the preceding claim, and the pigeonhole principle, for all large enough n 
there is a deterministic set of points x n ^ x n p, ■ ■ ■ , x n ,\5- d \B n \/2\ in B l £ such that 
\\x n ,j — ^n,fe||oo > 3 for each distinct j, k in {1, 2, ... , L5~ d |-B n |/2j }. 

For 1 < j < |_5 _d |.B n |/2j , let I n j be the indicator of the event that (i) each vertex 
y G Ij d with \\y — £n,j||oo = 1 is open, and (ii) the open cluster in B n containing all 
y G Z d with \\y — £ nj ||oo — 1 has diameter at least 7„. 

Set m{n) := \_5~ d p 3 ~ 1 9d(p)\B n \/8\ , with 6 dip) denoting the percolation probabil- 
ity. List the j for which I n j = 1 as J(n, 1), . . . , J(n, N n ), with N n := Ylj=i ' Bn '^ 2 ^ I n ,j- 
Then we have for n large that 

E[N n ] > [5- d \B n \/2\p 3fI - 1 e d ( P ) > 2m(n). 

Changing the open/closed status of a single site z in B n can change the value of I n j 
only for those j for which \\x n j — zW^ < j n , and the number of such j is at most 
(27„ + l) d . Moreover, for n large 

(27„ + l) d < (2(diam5 n ) 1 /^ + 3) d < 3 <i (diam J B n ) 1 / 4 < ^{B^ 4 

so that the total change in N n due to changing the status of a single site z is at 
most 3 d \B n \ 1 / 4 . So by Lemma 8.2, 

P[N n < m{n)\ < P[\N n - EN n \ > m{n)\ < 2exp 

and hence 

limsup \B n \- 1/2 log P[N n < m(n)\ < 0. (8.4) 

n— >oo 

Let Vi, V2, ■ ■ ■ be a sequence of independent Bernoulli variables with parameter p, 
independent of everything else. For 1 < j ' < |_5~ d |.B n |/2j , let I' n j be the indicator of 
the event that the vertex x n j is open. Set 

(m(n)-N n )+ 

S n :=S' n + V r 

Let Y n be the size of the largest open cluster in B n if the status of x,i >n is set to 
'closed' for the first min(m(n), N n ) values of j for which I n j = 1. 

Then S n has the Bin(m(n),p) distribution and we assert that its distribution, 
given Y n , is unaffected by the value of Y n so S n is independent of Y n . Indeed, Y n 
is obtained without sampling the status of the sites x n j for the first min(m(n), iV n ) 
values of j for which I n j = 1. 



m(n)' 



2\B n \(3 d \B n \^y 



min(m(n),7V n ) 

s" ■= V 

n ' / j n,J(n,j) 
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To explain this further, consider algorithmically sampling the open/closed status 
of sites in B n as follows. First sample the status of sites outside Uj{x n j}. Then 
sample the status of those x n j for which the ^-neighbouring sites are not all open 
(for these sites, I n j must be zero). At this stage, it remains to sample the status of 
sites x n j for which the ^-neighbouring sites are all open, and for these sites one can 
tell, without revealing the value of x n j, whether or not I n j = 1 (and in particular 
one can determine the value of N n ). At the next step sample the status of all x n ^ 
except for the first mm(N n ,m(n)) values of % which have I n j = 1. At this point, 
the value of Y n is determined. However, the value of S n is determined by the status 
of the remaining unsampled sites together with some extra Bernoulli variables in 
the case where N n < m(n), so its distribution is independent of the value of Y n as 
asserted. 

Next, we establish that L(B n ) = Y n + S n with high probability. One way in 
which this could fail would be if N n < m(n), but we know from (8.4) that this 
has small probability. Also, we claim that with high probability, all sites x n j for 
which I n j = 1 have all their neighbouring sites as part of the largest open cluster, 
regardless of the status of x n ^. To see this, let A n be the event that (i) there is a 
unique open cluster for B n that crosses B n in all directions (in the sense of [19]) and 
(ii) all other clusters in B n have diameter less than j n . Then we claim that -PL4^] 
decays exponentially in 7„ in the sense that 

lim sup(diam J B n ) 1/(4d) log P[A c n ] < 0. (8.5) 

n— s-oo 

The proof of (8.5) proceeds as in proof of Lemma 3.4 of [19]; we include a sketch of 
this argument here for completeness. 

First suppose d = 2. For a given rectangle of dimensions (7 n /3) x 7„, the 
probability that it fails to have an open crossing the long way decays exponentially 
in 7„ (see Lemma 3.1 of [19]). Consider the family of all rectangles of dimensions 
(7n/3) x 7„ or of dimensions 7„ x (7 n /3), with all corners in (7„/3)Z 2 , having non- 
empty intersection with B n . The number of such rectangles is 0(diam(.B n ) (i_1 / 2 ). 
By the preceding probability estimate, all rectangles in this family have an open 
crossing the long way, except on an event of probablity decaying exponentially in 
7 n . However, if all these rectangles have an open crossing the long way, then event 
A n occurs and we have justified (8.5) for d = 2. 

For d > 3, by the well known result of Grimmett and Marstrand [12], there exists 
a finite K such that there is an infinite open cluster in the slab [0, K] x W 1 " 1 with 
strictly positive probability. By dividing B n into slabs of thickness K we see for 
1 < % < d that the probabilty that there is no open crossing of B n in the i-direction 
decays exponentially in diam(S n ). Moreover, for % ^ j, by a similar slab argument 
(consider successive slabs of thickness K in the i direction), the probability that 
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there is an open cluster in B n that crosses B n in the % direction but not the j 
direction decays exponentially in diam(5 n ). Similarly the probability that there 
are two or more disjoint open clusters in B n which cross in the % direction decays 
exponentially in n. Finally by a further slab argument, the probability that there is 
an open cluster which has diameter at least j n / d in the i direction but fails to cross 
the whole of B n in the j direction, decreases exponentially in j n . This justifies (8.5) 
for d > 3. 

Note that the occurrence or otherwise of A n is unaffected by the open/closed 
status of those x n j for which I n j = 1. Also, for large enough n, on event A n , 
whatever status we give to these x n j, the unique crossing cluster is the largest one 
because it has at least diam(_B n ) elements while all other clusters have at most 
0(diam(i? n ) 1/ ' 4 ) elements. 

If N n > m(n) and event A n occurs, then for each j < m(n), the site x n j( n ,j) is 
in the largest open cluster if and only if it is open, since if it is open then it is in an 
open cluster of diameter at least 7„. This shows that if N n > m(n) and event A n 
occurs, we do indeed have L(B n ) = Y n + S n . Together with the previous probability 
estimates (8.4) and (8.5), this shows that \B n \^ 2 P[L(B n ) ^ Y n + S n ] ->■ as n — > oo. 
Moreover, by the Cauchy-Schwarz inequality, 

E [\L(B n ) - (Y n + S n )\] = E [\L(B n ) - (Y n + S n )\l 

< (P[N n < m{n)\ + ?K]) 1/2 (E [(L(B n ) - (Y n + Sn)) 2 ]) 1 / 2 
< (P[N n < m(n)\ + P[^]) 1/2 (|5n| + m(n)) -+ 0. 

By Theorem 3.2 of [19], the first conclusion (3.5) holds, and by the preceding discus- 
sion, we can then apply Theorem 2.1 with hy — 1, to derive the second conclusion 
(3.6). □ 

9 Proof of Theorem 4.1 

We are now in the setting of Section 4. Assume / = fy, and fix a feasible connected 
graph T with k vertices (2 < k < oo). Assume also that the sequence (r„) n >i is 
given and satisifies (4.2) and (4.3). Then P[Q{X k ,1/{k + Z)) ~ V] e (0,1). Let 
Q n ,i, Qn,2, ■ ■ ■ , Qn,m(n) be disjoint cubes of side (/t + 5)r„, contained in the unit cube, 
with m(n) ~ ((« + 5)r n )~ d as n — > oo. For 1 < j < m(n), let I n j be the indicator 
of the event that X n D Q n j consists of exactly k points, all of them at a Euclidean 
distance greater than r n from the boundary of Q n j. List the indices j < m(n) such 
that I n j — 1, in increasing order, as J n ,i, . . . , J n ,N n , with N n := In,j- Then 

E [N n ] = m{n){{K + 3)/(k + 5)) dK P[Bin(n, ((« + 5)r n ) d ) = «], (9.1) 



25 



and hence as n — > oo, since nr d is bounded by our assumption (4.2), 

E [7V n ] ~ K \-\ K + 3) dK (K + 5)~ d n K r% K ~ 1} exp(-n(K + 5) d r d ). (9.2) 

Recalling from (4.3) that r n := A/n(nr^) K_1 , we can rewrite (9.2) as 

E [iV n ] ~ kI-^k + 3) dK (K + 5)^ exp(-7i(/c + 5) d r d ) (9.3) 

as n — > oo. Moreover, for the Poissonised version of this model where the number 
of points is Poisson distributed with mean n, we have the same asymptotics for the 
quantity corresponding to N n (the binomial probability in (9.1) is asymptotic to the 
corresponding Poisson probability). Set a to be one-quarter of the coefficient of 
in (9.3), if the exponential factor is replaced by its smallest value in the sequence, 
i.e. set 

a := (4d)- 1 (/t + 3) dK (/t + 5)- d infexp(-n(/t + 5)^). (9.4) 

n 

Then a > by our assumption (4.2) on r n . 
Lemma 9.1. It is the case that 

lim sup r r 7 2 log P [N n < ar^\ < 0. 

Proof. Let 5 > (to be chosen later). Let M n be Poisson distributed with 

parameter (1 — 5)n, independent of the sequence of random d- vectors Xi,X 2 , 

Define the Poisson point process 

V n (iS) := {X 1 , . . . ,X Mn }- 

Let N' n be defined in the same manner as N n but in terms of V n (\s) rather than 
X n . That is, set 

m(n) 

N' := V I' 

i=i 

with I' n j denoting the indicator of the event that P n (i-5) H Q n j consists of exactly 
k points, all at distance greater than r n from the boundary of Q n j. List the indices 
j < M n such that I' n ^ = 1 as J' n l , . . . , J' n Nk . 

Since (9.3) holds in the Poisson setting too, using the definition of r n we have as 
n — )> oo that 

E [N' n \ ~ k\-\k + 3) dK (K + 5y d (l - 5) k tI exp(-n(l -6)(k + 5) d r d ). (9.5) 



26 



By (9.3) and (9.5), we can and do choose 5 > to be small enough so that EN n > 
(3/4)EiV n for large n. 

By (9.3) and (9.4) we have for large n that 2ar% < (5/8)E JV n . Also, N' n is bi- 
nomially distributed, and hence by Lemma 8.1, P[N' n < 2ar^\ decays exponentially 
in 

By Lemma 8.1, except on an event of probability decaying exponentially in n, 
the value of M n lies between n(l — 25) and n. If this happens, the discrepancy 
between N n and N' n is due to the addition of at most an extra 2Sn points to P n (i-5) • 
If also N' n > 2ar% then to have N n < ar%, at least ar^ of the added points must 
land in the union of the first \2ar^\ cubes contributing to N' n . 

To spell out the preceding argument in more detail, let 1 < j < m(n). If 
M n < n and I' n ^ — 1 and ^ Q n j for M n < k < n, then I n j = 1, since in this 
case X n fl Q n j = V n (\s) H Q n j. Therefore if M n < n and N' n > 2ar% and 

n 

l{X k eU^Q n ^J<arl 

k=M n +l 

then X n fl Q n ,j' n . 7^ P n (i-S) H Q n ,j' n . for at most [ar n \ values of j G [l^ccr^], and 
hence 

Hence, if n(l-25) < M n < n and N' n > 2ar 2 n and ESS 1 l{^fc e U^T" 1 ^,./;,.} < 
ar n , then iV n > ar^. Hence 

P[iV n < crr^A^ > 2ar^, n - 25n < M n < n] 
< P[Bm(\25n], \2ar^((K + 5)r n ) d ) > ar 2 n \. 

Since nr^ is assumed bounded, we can choose 8 small enough so that the expecta- 
tion of the binomial variable in the last line is less than (a/2)r n l , and then appeal 
once more to Lemma 8.1 to see that the above conditional probability decays ex- 
ponentially in t%- Combining all these probability estimates give the desired result. □ 

Proof of Theorem 4.1. Set p := P[G(X K , 1/(k + 3)) ~ T]. Let V u V 2 , ... be a 
sequence of independent Bernoulli variables with parameter p, independent of X n . 
Let 

min(Lar2j,Ar n ) 

S' n := l{6(^nQ MM ;r B )~r}; Y n :=G n -S' n , 
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and 



S n := S' n + ^ Vj, 

3=1 

where x + := max(x, 0) as usual, and the sum Yl°j=i i s taken to be zero. 

For each j, given that I n j = 1, the distribution of the contribution to G n from 
points in Q n j is Bernoulli with parameter P[Q((k + 3)r n X K ,r n ) ~ T], which is p. 
Hence S n is binomial Bin([a;r^J,p). Moreover, the conditional distribution of S n , 
given the value of Y n , does not depend on the value of Y n , and therefore S n is 
independent of Y n . By (4.5), 

[ar 2 ] _1/2 (G n -EG„) A JV(0, oTV) . 



Moreover, 

E[|G n -(Y n + S n ) 



E 



([ar2j-JV n )H 



< P K 2 jP[iV n <ar n 2 ] 



so that by Lemma 9.1, both r n P[G n ^ Y n + S'n] and r~ 1 E[|G n — Y n — S^l] tend 
to zero as n — > oo. Hence, Theorem 2.1 (with hy = 1) is applicable, with \o.r 2 \ 
playing the role of n in that result and a 1//2 r n playing the role of c„, yielding 



sup 



a 1/2 r n P[G n = k] - a^a' 1 ^ 



k-EGr. 



as n — > oo. Multiplying through by a" 1//2 yields (4.6). 



^0, 



□ 



10 Proof of Theorem 5.1 

Recall the definition of hx (the span of X) from Section 2. 

Lemma 10.1. If X and Y are independent random variables then hx+y\hx- 

Proof. If hx+Y = there is nothing to prove. Otherwise, set h = hx+y- Then, 
considering characteristic functions, observe that 

1 = |E exp(27ri(X + Y)/h)\ = |E exp(27viX / h)\ x |E exp(27riF//i)| 

so that |E ex. i p(2mX / h)\ = 1 and hence h\h X - □ 
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We are in the setup of Section 5. Recall that the point process Z n consists 
of n normally distributed marked points in M. d , while U Uj k consists of n uniformly 
distributed marked points in B(K). Set h n> K '■= hn(u* K )- Set h n := hn(z*), and 
recall from (5.4) that h(H) := liminf^oo h n . 

Lemma 10.2. Suppose either (i) H has finite range interactions and hutz*) < 00 
for some n, or (ii) H = is induced by a n-nearest neighbour functional £(x; X*), 
and hu(z*) < °o for some n > k. Then h(H) < oo, and if h(H) > 0, there exists 
fi G N and K > such that h^x = h(H). If h(H) = 0, then for any e > there 
exists ji G N and K > such that h^x < £■ In case (ii), we can take /i such that 
additionally /i> k + 1. 

Proof. The support of the distribution of H(U* K ) is increasing with K, so 
hn,K'\h n ,K for K' > K. Hence, there exists a limit /i„ i00 such that 

K )00 = lim hnjc (10.1) 

K— >oo 

and also we have the implication 

h n ,oo > =>• : h nt K = h n)00 . (10.2) 

Also, for all K the support of the distribution of H(U* K ) is contained in the support 
of H(Z*), so that 

h n = h H (z*) < K,k, Vif, (10.3) 

and hence h n < h nt00 for all n. We assert that in fact 

K,oc = K. (10.4) 

This is clear when h n)OQ = 0. When h n>00 > 0, there exists a countable set S with 
span h ritOQ such that P[H(U* K ) G S] — 1 for all K. But then it is easily deduced 
that P[H(Z*) G S] — 1, so that h n > h noo , and combined with (10.3) this gives 
(10.4). 

We shall show in both cases (i) and (ii) that h n tends to a finite limit; that is, 
for both cases we shall show that 

h(H) = lim h n = lim h noo < oo. (10.5) 

Also, we show in both cases that 

h(H) > 3n G N : h n = h(H) Vn > n . (10.6) 
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If h(H) > 0, the desired conclusion follows from (10.6), (10.4) and (10.2). If h(H) = 
0, the desired conclusion follows from (10.5) and (10.1). 

Consider the case (i), where H has finite range interactions. In this case, we 
shall show that for all n, 

h n+1 \h n , (10.7) 

and since we assume h n < oo for some n, (10.7) clearly implies (10.5) and (10.6). 

We now demonstrate (10.7) in case (i) as follows. By (10.4) and (10.2), to prove 
(10.7) it suffices to prove that h n+ i\h n ^ K for all K. Choose r such that (5.2) holds. 
There is a strictly positive probability that the first n points of Z n lie in B(K) while 
the last one lies outside BiK + r). Hence by (5.2) and translation-invariance, the 
support of the distribution of H(Z* +1 ) contains the support of the distribution of 
H(U* K ) + H({(0, T)}), where T is a Px -distributed element of M, independent of 
U* K . Hence by Lemma 10.1, h n+1 \h njK , so (10.7) holds as claimed in this case. 

Now consider case (ii), where we assume H = with £(x; X) determined by 
the k nearest neighbours. We claim that if j > k + 1 and £ > k + 1 then 

hj + e\hj and hj + e\he. (10.8) 

By (10.2) and (10.4), to verify (10.8) it suffices to show that 

h j+e \h j:K \/K > 0. (10.9) 

Given K, let B and B' be disjoint balls of radius K, distant more than 2K from 
each other. There is a positive probability that Zj+e consists of j points in B and 
£ points in B', and if this happens then (since we assume min(j, £) > k)) the n 
nearest neighbours of the points in B are also in B, while the k nearest neighbours 
of the points in B' are also in B', so that H(Z* +e ) is the sum of conditionally 
independent contributions from the points in B and those in B'. Hence the support 
of the distribution of H(Z* +e ) contains the support of the distribution of H{U* K ) + 
H(U^ K ), where R{U* lK ) is defined to be a variable with the distribution of H(U^ K ) 
independent of H(U* K ). Then (10.9) follows from Lemma 10.1. 
Define 

h! = inf h n . 

n>K+l 

Then for all e > we can pick j > k + 1 with hj < h' + e, and then by (10.8) we 
have he < h' + e for £ > j + k + 1. This demonstrates (10.5) for this case (with 
h(H) = h r ), since we assume h n < oo for some n. Moreover, if h(H) > 0, then in 
the argument just given we can take e < h(H) and then for £ > j + k + 1 we must 
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have he\hj, which can happen only if he = hj, so by (10.5), in fact he = hj = h(H). 
That is, we also have (10.6) for this case. □ 

Since we are in the setting of Section 5, we assume (as in Section 4) that / is an 
almost everywhere continuous probability density function on M d with / max < oo. 
The point process X n C R d is a sample from this density, and the marked point 
process X* C M. d x M. is obtained by giving each point of X n a P^-distributed 
mark. Recall also that we are given a sequence (r n ) with p := lim^oo nr d G (0, oo). 
Recall from (5.1) that H n (X*) := H(r~ 1 X*') for a given translation-invariant H . 

Our strategy for proving Theorem 5.1 goes as follows. First we choose p, K as 
in Lemma 10.2. Then we choose constants (3 > K and m > p in a certain way (see 
below), and use the continuity of / to pick Q(n) disjoint deterministic balls of radius 
(3r n such that / is positive and almost constant on each of these balls. We use a form 
of rejection sampling to make the density of points of X n in each (unrejected) ball 
uniform. We also reject all balls which do not contain exactly m points of X n in a 
certain 'good' configuration (of non- vanishing probability). The definition of 'good' 
is chosen in such a way that the contribution to H n from inside an inner ball of radius 
Kr n is shielded from everything outside the outer ball of radius (3r n . We end up with 
Q(n) (in probability) unrejected balls, and the contributions to H n (X*) from the 
corresponding inner balls are independent (because of the shielding) and identically 
distributed (because of the uniformly distributed points) so the sum contribution of 
these inner balls can play the role of S n in Theorem 2.1. 

In the proof of Theorem 5.1, we need to consider certain functions, sets and 
sequences, defined for /3 > 0. For x G M. d with f(x) > 0, define the function 

/x m ^{f(y) '■ U B(x; f3r n )} /mirrt 
:= M ): V eB( Xi Pr n) y (1 °- 10) 
and for x G W 1 with f(x) > and gn,p{x) > 0, and z G B(x; /3r n ), define 

fclJ (v),^|«. (10.11) 

Since we assume / is almost everywhere continuous, the function g n ^ converges 
almost everywhere on {x : f(x) > 0} to 1. By Egorov's theorem (see e.g. [9]), given 
f3 > there is a set Ap with f A ^ f(x)dx > 1/2, such that f(x) is bounded away 
from zero on Ap and g n ^(x) — > 1 uniformly on Ap. 

Since we assume (4.2) with p > here, for n large enough nr d < 2p. Set 

71(0) := 2-^Wr d f- 1 L P - 1 - 

Given j3 > 0, we claim that for n large enough so that nr d < 2p, we can (and 
do) choose points x p>nil , . . . , xp :7ljW)ni in Ap with \xp^ - x M \ > 2/3r n for 1 < 
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j < k < [r)((3)n\. To see this we use a measure-theoretic version of the pigeonhole 
principle, as follows. Suppose inductively that we have chosen xp^i, . . . , xp tnjk , with 
k < \j]((3)n\. Then let xp iUjk+ i be the first point, according to the lexicographic 
ordering, in the set Ap \ L) 1 j =1 B(xp tn j] 2/3r„). This is possible, because this set is 
non-empty, because by subadditivity of measure, 

/ f(x)dx < ku d (2/3r n ) d f max < v(P)nco d (2/3r n ) d f max 

= nr d J(Ap) < 1/2 < / f(x)dx, 
Ja 

justifying the claim. Define the ball 

Bf3, n ,j ■= B(x/3,n,j, Pr n ); Bp tnJ := B(xp, nd , (3r n ) x M. 

The balls Bp >n ^ . . . , S^n^H are disjoint. 

Let W\, W2, W3, ... be uniformly distributed random variables in [0, 1], indepen- 
dent of each other and of (Xj)™ =1 , where Xj = (Xj,Tj). For k e N, think of W k 
as an extra mark attached to the point This is used in the rejection sampling 
procedure. Given j3, if X k e Bp ;7l j, let us say that the point X k is (3-red if the 
associated mark W k is less than p n ,/3( x /3,n,j, X k ). Given that X k lies in Bp n j and is 
/3-red, the conditional distribution of X k is uniform over Bp^ n j. 

Now let m G N, and suppose S is a measurable set of configurations of m points in 
B{0) such that P[U mj p G S] > 0. The number m and the set S will be chosen so that 
given there are m points of X n in ball Bp >n j, and given their rescaled configuration 
of lies in the set S, there is a subset of these m points which are 'shielded' from the 
rest of X n . 

Given S (and by implication j3 and m), for 1 < j < [i](f3)n\, let Is,n,j be the 
indicator of the event that the following conditions hold: 

• The point set X n fl Bp ;7l j consists of m points, all of them /3-red; 

• The configuration r~ l (-xp^j + {X n H Bp, nJ )) is in S. 

Let Ns^ n := 5^ j=f Is,n,j, and list the i for which Is,n,j = 1 in increasing order as 
J(S,n, 1) . . . , J(S,n,N s , n )- 

Lemma 10.3. Let f3 > 0, and m G N. Let S be a measurable set of configurations 
of m points in B((3) such that P\U m ^ G S] > 0. Then: (i) there exists 5 > such 
that 

limsup (n" 1 logPLV 5 , n < Sn\) < 0, (10.12) 
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and (ii) conditional on the values of Is, n ,i f or 1 < z < [v(P) n \ an d ^ e configuration 
of X n outside B^ n j(s, n ,\) U • • • U Bp^ J( s^ Nsn ), the joint distribution of the point 
sets 

^ 1 (-^, n ,j(5, n ,i)+(A;nB j 3 ini j( Sin] i))), . . . ,r~ 1 (-xp^ J { S ^ Nsn ) + (X n nBp^j {s ^ Nsn) )) 
is that of Ns, n independent copies ofU m ,/3 each conditioned to be in S. 

Proof. Consider first the asymptotics for E [Afe^] . Given a finite point set 
X C M. d and a set B C M. d , let X(B) denote the number of points of X in B. Fix 
m. Since / is bounded away from zero and infinity on Ap and g n ^ — > 1 uniformly 
on Ap, we have uniformly over x G Ap that 

n [ f(y)dy = nf(x) [ (f(y)/f(x))dy d u d pf(x) 

JB{x;f5r n ) J B(x;/3r n ) 

Hence by binomial approximation to Poisson, 

P[X n (B(x] pr n )) —m\—> : as n — > oo, 

and this convergence is also uniform over x G Ap. 

Given m points X k in Bp^ n j, the probability that these are all /3-red is at least 
9n,p{.x) m so exceeds | if n is large enough, since g Uj p — > 1 uniformly on Ap. 

Given that m of the points X k lie in Bp^ n j, and given that they are all /3-red, 
their spatial locations are independently uniformly distributed over Bp^ n j; hence the 
conditional probability that r~ 1 (—xp jn j + (X n r\Bp n j)) lies in S is a strictly positive 
constant. 

These arguments show that liminf^oo n _1 E [A^s >n ] > 0. They also demonstrate 
part (ii) in the statement of the lemma. 

Take 5 > with 25 < lim inf n _1 E [N s , n ]. We shall show that P[N s , n < 5n] 
decays exponentially in n, using Lemma 8.2. The variable Ns, n is a function of n 
independent identically distributed triples (marked points) (X k ,T k ,W k ). 

Consider the effect of changing the value of one of the marked points {{X, T, W) 
to (X',T',W), say). The change could affect the value of Is,n,j f° r & t most two 
values of j, namely the j with X G Bp >n j and the f with X' G Bp ;7l j/. So by 
Lemma 8.2, 

P[\N s , n -EN s , n \ > 5n] < 2exp(-5 2 n/8), 
and (10.12) follows. □ 

Proof of Theorem 5.1 under condition (i) (finite range interactions). Recall that 
h(H) is given by (5.4). Since condition (i) includes the assumption that hH(z*) < °o 
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for some n, by Lemma 10.2 we have h(H) < oo. Let b > with h(H)\b. Let 

e G (0,6). Let fi G N, and K > 0, be as given by Lemma 10.2. Then h^x = 

if h > 0, or fy^x < s ii h — 0. Moreover H(U* K ) is integrable by assumption. Set 



b 



if h^K > 
if h^ K = 0. 



10.13) 



Choose r G (0, oo) such that (5.2) holds. We shall apply Lemma 10.3 with 
(5 = K + r. Let S be the set of configurations of /i points in B(K + r) such that 
in fact all of the points are in B(K). By Lemma 10.3, we can find 5 > such that, 
writing N n for N$ !n we have exponential decay of P[N n < 8n\. 

Let Vi, V 2 , . . . , be random variables distributed as independent copies of H(U* K ), 
independently of X*. Set 



min( \5n\ ,N n ) 

E 



(L<5nJ-Af n )+ 



Thus, is the the total contribution to H n (X*) from points in u™n(L<5nj,v n )^^ ^ ^ 

By Part (ii) of Lemma 10.3, given that iV n > 5n, for each £ we know that 
r~ 1 (—xp yn) j(s,n,e) + X*)r\B*(K + T) is conditionally distributed as U* K+T conditional 
on U* K+T G S; in other words, distributed as W*^. Therefore the distribution of 
S n is that of the sum of [Sn\ independent copies of H(U* K ), independent of the 
contribution of the other points. Let Y n denote the contribution of the other points, 
i.e. 

Y n := H n (X*) - S' n . 

Since the distribution of S n , given the value of Y n , does not depend on the value of 
Y n , S n is independent of Y n . 

By assumption H n (X*) and S n are integrable. Clearly n 1 / 2 P[iJ n (A' n ) ^ Y n + S n ] 
is at most n 1 / 2 P[A^" n < Sn], which tends to zero by (10.12). Also by conditioning on 
N n , we have that 



n 



^\\H n {X: i )-{Y n -rS n ) 



n 



([Sn]-N n y 

E 



< n~ 1/2 E [( [5n\ - N n ) + ]E [|Vi|] 
[6n\P[N n < 6n]E [W] 



< nr 1/2 \5n\P\N n < 5n]E \\ViW , (10.14) 

which tends to zero by (10.12). This also shows that Y n is integrable By the as- 
sumption (5.5), 

L<H" 1/2 (#n«) -M n «)) A^o.rV), (lo.is) 
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and so, since h^x\bi, Theorem 2.1 is applicable, and yields 

sup { | (5n)^P[H n (X:) e[u,u + bl )] - 6V**-%<f> ( ( ^~^™ /2 ) | } 0, 

(10.16) 

and dividing through by <5 1//2 gives (5.6) in all cases where b — b±. In general, suppose 
b ^ 6i. Then = (else ^ = /i(if) and so b = 6i by (10.13)), and 

hence < e. Since b\ < b by (10.13), we have that 

inf [n^P[H n (X>) € [u, u + 6)] - ( ^fg^ ) } 

> inf (n^PI^t^*) 6 [«, u + h)\- a-\4> ( "~ E "" m ) } 
ueR [ y n i a J ) 

+^ 1 (6 1 -6)(2tt)- 1 / 2 

so that by (10.16), since &i > 6 — e, 

hminf inf (n^P^^) e[u,u + b)] - a'^ ( u ~ EH ^ X n) \ 1 > _£ (27r) -i/2. 
Similarly, setting 6 2 := h^xlb/h^x], we have that 

sup (n^P^K*) G [«,« + b)] - a-^ f ""^*"^ 
«eK L V 71/ (7 / 

< inf jn^P^TO G [it, it + fc 2 )l + r^M ~ 

+<T -i( 62 _ 6 )(2 7r )- 1 /2 

so that since b 2 — b < e, 

hmsupsupln^P^K*) G [u, t* + 6)] - ~ E ) j < i(27r)" 1 / 2 . 

Since e > is arbitrarily small, this gives us (5.6). □ 

Proof of Theorem 5.1 under condition (ii). We now assume that H, instead 
of having finite range, is given by (5.3) with £ depending only on the k nearest 
neighbours. Again, by Lemma 10.2 we have that h(H), given by (5.4), is finite. 

Let b > with h(H)\b. Let e G (0,6). Let \x G N and K > 0, with fi > k+ 1, by 
as given by Lemma 10.2. Then h^x = h(H) if h{H) > 0, and h^x < £ if fr(-ff) = 0. 
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Also, H(U* K ) integrable, by the integrability assumption in the statement of the 
result being proved. 

Let B±, B2, ■ ■ ■ , B v be a minimal collection of open balls of radius K, each of them 
centred at a point on the boundary of B(AK), such that their union contains the 
boundary of B(AK). Let B be the ball B(K). 

We shall apply Lemma 10.3 with j3 = 5K, with m = [y + and with S as 
follows. S is the set of configurations of m — {y + points in B(f3) = B(5K), 
such that each of £>i, . . . , B v contains at least \x points, and \J" =1 Bi contains exactly 
ujj, points, and also the ball £>o contains exactly /i points (so that consequently 
there are no points in B(5K) \ U^ =0 £>i). A similar construction (using squares rather 
than balls, and with diagram) was given by Avram and Bertsimas [1] for a related 
problem. 

With this choice of j3 and S, let the locations xp tn j = x$K,n,j, the balls Bp tJl j = 
B^K,n,ji the indicators Is,n,j, and the variables Ns, n an d J(S,n,£) be as described 
just before Lemma 10.3. By that result, we can (and do) choose 5 > such that 
(10.12) holds. 

For 1 < £ < N Sjn , the point process r-\-x hKiniJ { S , n j) + (X n nB 5Ktni j^ s ,n/))) has 
\x points within distance K of the origin, and also at least fi points in each of the 
balls Bi, ... , B v . 

Since fx> k + 1, for any point configuration in S, each point inside B(K) has its 
k nearest neighbours also inside B(K). Also none of the points in B(5K)\B(K) has 
any of its k nearest neighbours in B(K). Finally, any further added point outside 
B(5K) cannot have any of its k nearest neighbours inside B(K), since the line 
segment from such a point to any point in B(K) passes through the boundary of 
B(AK) at a location inside some Bi, and any of the \i or more points inside £>« are 
closer to the outside point than the point in B(K) is. To summarise this discussion, 
the points in B(K) are shielded from those outside B(5K). 

Given n, let 5K , . . . , W/,^? 5K be a collection of (marked) point pro- 

cesses which are each distributed as U^ u+1 ^ 5K conditioned on 5K G S, in- 

dependently of each other and of X*. For 1 < j < \_5n\ set Vj := H(w9J +1 ^ 5K H 
B*(K)), so that Vi,V 2 , . . . V^ n \ are random variables distributed as independent 
copies of H(U* K ), independent of X n . Define S' n and S n by 

min(L5nJ,AT 5j „) (lSn}~N s , n )+ 

S' n --= E H n (X:nB*(x 5K , n , J{s , n>e) ,Kr n )); S n :=S' n + ^ V r 
e=i 3=1 

Also set Y n := H n (X*) - S' n . 

Thus S' n is the total contribution to H n (X*) from points in B*(x§K,n,j(s,n,t\] Kr n ), 
1 < £ < min( [5n\ , Ns >n ). On account of the shielding effect described above, S n 
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is the sum of [5n\ independent copies of a random variable with the distribution 
of H{U* K ). Moreover, we assert that the distribution of S n , given the value of Y n , 
does not depend on the value of Y n , and therefore S n is independent of Y n . 

Essentially, this assertion holds because for any triple of sub- a- algebras Ti, T 2 , T3, 
if T\ V Ti is independent of T3 and T\ is independent of T 2 then T\ is independent 
of Ti V T 3 (here Ti V Tj is the smallest a-algebra containing both Ti and Tj). In 
the present instance, to define these a-algebras we first define the marked point 
processes 3^- for 1 < j < [5n\ by 



y.i 



r n \-x 5K ,n y j{s,n, j ) + {X* n B* bK n J{S n j) )) if 1 < j < min( [5n\ , N s , n ) 
\Kl^K XN Stn <j<\.5n\. 

Take T 3 to be the a-algebra generated by the values of J(<S, n, 1), . . . , 
J(S, n, min( [5n\ , Ns >n )) and the locations and marks of points of X n outside the 
union of the balls B 5K:n j iSjnA) , B^ K ^ J{S)nMn ^ Sn ^ Nsn)) . Take T 2 to be the a- 
algebra generated by the point processes D B*(5K) \ B*(K), 1 < j ' < \_5n\ . Take 
T\ to be the a- algebra generated by the point processes fl B*(K), 1 < j < [Sn\. 
Then by Lemma 10.3 and the definition of S, T\ V Ti is independent of T3 and T\ is 
independent of T 2 , so T\ is independent of T 2 \l T 3 . The variable S'n is measurable 
with respect to T±, and by shielding, the variable Y n is measurable with respect to 
T2 V T3, justifying our assertion of independence. 

By the assumptions of the result being proved, H n (X*) and S n are integrable. 
Clearly n 1/2 P[H n (X*) ^Y n + S n ] is at most n 1/2 P[N s ^ n < 5n], which tends to zero. 
Also, as with (10.14) in Case (i), we have that n _1 / 2 E [\H n (X*) - (Y n + S n )\] tends 
to zero by (10.12), and Y n is integrable. By (5.5), 

[5n\- l ' 2 (H n (X* n ) -EH n (X:)) A AT(0,rV 2 ), (10.17) 
and so, since /i^l&i, Theorem 2.1 is applicable with Z n = H n (X*), yielding 

s {\wp mx:) 6 K „ +i>i)i- ^o-'w ( ;;^-™ ) I } - 0, 

asn-} 00. Multiplying through by b~ x l 2 yields (5.6) for this case, when h\ = b. If 
b\ 7^ b, we can complete the proof in the same manner as in the proof for Case (i). 
□ 



11 Proof of Theorems 5.2, 5.3 and 5.4 

The proofs of Theorems 5.2, 5.3 and 5.4 all rely heavily on Theorem 2.3 of [20] so 
for convenience we state that result here in the form we shall use it. This requires 
some further notation, besides the notation we set up earlier in Section 5. 
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As before, we assume £(x, X*) is a translation invariant, measurable R-valued 
function defined for all pairs (x, X*), where X* C M. d x M. is finite and x is an element 
of X*. We extend the definition of £(x, X*) to the case where X* C R d x M and 
x G (W 1 x M) \ X*, by setting £(x, #*) to be f (x, X* U {x}) in this case. Recall 
that is defined by (5.3). 

Let T be an M. -valued random variable with distribution P^, independent of 
everything else. For A > let M\ be a Poisson variable with parameter A, indepen- 
dent of everything else, and let V\ be the point process {Xi, . . . ,Xm x }, which is a 
Poisson point process with intensity Xf(-). Let V\ := {(Xi, 7\), . . . , (Xm x ,Tm x )} be 
the corresponding marked Poisson process. 

Given A > 0, we say £ is A-homoegeneously stabilizing if there is an almost surely 
finite positive random variable R such that with probability 1, 

£((0, T); (H* x H B*(0; R)) U y) = £((0, T);T-L* x f] B*(0; R)) 

for all finite y C (M d \-B(0; i?)) x M. Recall that supp(/) denotes the support of /. 
We say that £ is exponentially stabilizing if for A > 1 and x G supp(/) there exists 
a random variable R Xt \ such that 

m 1/d x,n * 1/d (n n a- 1 /^,)) □ 

= £((A 1/d :r, T); A 1 '^ H 5*(:r; A" 1 ^))) 

for all finite y C (M d \ B(x;\~ l l d R Xy \))) x .M, and there exists a finite positive 
constant C such that 

P[^,a > s] < Cexp(-C- 1 S ), s > 1, A > 1, / G supp(/). 

For G N U {0}, let T k be the collection of all subsets of supp(/) with at most k 
elements. For k > 1 and ^4 = {x\, . . . , Xk} G 7fc \ 7^—1, let A* be the corresponding 
marked point set {(xi, Ti), . . . , (x k , T k )} where T 1; . . . , T k are independent .M-valued 
variables with distribution P^, independent of everything else. If A G 7o (so A = 0) 
let .A* also be the empty set. 

We say that £ is binomially exponentially stabilizing if there exist finite positive 
constants C, e such that for all x G supp(/) and all A > 1 and neNn((l- e)A, (1 + 
e)A)), and ^4 G 72, there is a random variable i? Xj A,n,^ such that 

^(A^T); A 1 / d ((^; u a*) n i?*(x; a-v^ iW )) u y) 

= e((A 1/d x, T); A 1 '^** U X) n 5*(:r; A-^w))) (11.1) 

for all finite 3/c(K d \ B(x; X~ 1/d R x ,x, n ,A)) x -H and such that all A > 1 and all 
n G N n ((1 - e)X, (1 + e)X)), and all x G supp(/) and all A G 72, 

P[^,A,n^ > s] < Cexp(-C- 1 s), s > 1. 
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Given p > and e > 0, we consider the moments conditions 

snp E[|e((A 1 /^,T);A 1 / d (^UX))H < oo (11.2) 

\>i,x£supp(f) ,AeSi 

and 

snp E[\a(X 1/d x,T);X 1 / d (X:uA*))\ p ]<oo. (11.3) 

A>l,n6Nn((l-e)A,(l+e)A),x€supp(/),^65 3 

Theorem 11.1. Suppose H = if ^ is induced by translation-invariant £. Suppose 
that £ is / (x) -homogeneously stabilizing for Lebesgue-almost all x G supp(/) 7 and £ 
zs exponentially stabilizing, binomially exponentially stabilizing and for some e > 
and p > 2 satisfies (11.2) and (11.3). Suppose / max < oo and supp(/) bounded. 
Suppose (X(n),n > 1) is a sequence taking values in R + with \X(n) — n\ — Ofa 1 / 2 ) 
as n oo. Then there exists a > sfic/i £/ierf 

n _1/2 (if ( ^(A(n) 1/d A'*) -EH^(X(n) 1/d X*)) J\f{fd,a 2 ), 

and n- l Vdx{Htt\X{n) l / d X*) — > a 2 as n — >■ oo. 

Theorem 11.1 is a special case of Theorem 2.3 of [20], which also provides an 
expression for a in terms of integrated two-point correlations; that paper considers 
random measures given by a sum of contributions from each point, whereas here we 
just consider the total measure. The sets Vl^ and (for all A > 1) Q\ in [20] are taken 
to be supp(/). Our £ is translation invariant, and these assumptions lead to some 
simplification of the notation in [20]. 

Proof of Theorem 5.2. The condition that £(x; A"*) has finite range implies 
that H = has finite range interactions. Since £ has finite range r, £ is A- 
homogeneously stabilizing for all A > 0, exponentially stabilizing and binomially 
exponentially stabilizing (just take R = r, R x x = r and R x ,x, n ,A = r )- 

We shall establish (5.5) by applying Theorem 11.1. We need to check the mo- 
ments conditions (11.2) and (11.3) in the present setting. Since we assume that 
/max < oo, for any A > and any n G N with n < 2A, and any x G supp(/), the 
variable card(A"* D B*(x; rX^ 1 ^)) is binomially distributed with with mean at most 
<^d/max2r d . Hence by Lemma 8.1, there is a constant C, such that whenever n < 2A 
and x G supp(/) we have 

P[card« f]B*(x;rX- 1/d )) > u] < Cexp(-u/C), u > 1. (11.4) 

Moreover by (5.11) and the assumption that £ has range r, for A G T3 we have 

E [£((X 1/d x, T); X l/d {X* U „4*)) 4 ] < 7 4 E [(4 + card« fl B*(x; rA~ 1/d ))) 47 ] 
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so by (11.4) we can bound the fourth moments of £((X 1/d x, T); X l/d (X* U A*)) uni- 
formly over (x, X,n, A) G supp(/) x [l,oo) x N x T3 with n < 2A. This gives us 
(11.3) (for p = 4 and e = 1/2) and (11.2) may be deduced similarly. 

Hence, the assumptions of Theorem 11.1 are satisfied, with X(n) in that result 
given by X(n) = r~ d . By Theorem 11.1, for some a > we have (5.5) and (5.9). 
Then by Theorem 5.1, we can deduce that a > and h(H) < 00 and (5.6) holds 
whenever h(H)\b. □ 

Proof of Theorem 5.3. Under condition (5.2), the functional H(X*) can be 
expressed as a sum of contributions from components of the geometric (Gilbert) 
graph Q(X,t), where X := ir(X*) is the unmarked point set corresponding to X* 
(recall that n denotes the canonical projection from M. d x Ai onto M. d .) Hence, 
H(X*) can be written as H®(X*) where £(x; X*) denotes the contribution to H(X*) 
from the component containing 7r(x), divided by the number of vertices in that 
component. Then £(x; A"*) is unaffected by changes to X* that do not affect the 
component of Q(X,t) containing vr(x), and we shall use this to demonstrate that 
the conditions of Theorem 11.1 hold, as follows (the argument is similar to that in 
Section 11.1 of [18]). 

Consider first the homogeneous stabilization condition. For A > 0, let -R(A) be 
the maximum Euclidean distance from the origin of vertices in the graph G(H\ U 
{0},t) that are pathwise connected to the origin, which by scaling (see the Map- 
ping theorem in [15]) has the same distribution as r times the maximum Euclidean 
distance from the origin of vertices in G{7-L T dx U {0}, 1), that are pathwise connected 
to the origin. Then R{\) is almost surely finite, for any A G (0,r~ d A c ). 

Changes to H\ at a distance more than R(X) + r from the origin do not affect 
the component of G(H\ U {0},r) containing the origin and therefore do not affect 
£((0, T); H* x ). This shows that £ is A- homogeneously stabilizating for any A < r~ d \ c , 
and therefore by assumption (5.12) the homogeneous stabilization condition of The- 
orem 11.1 holds. 

Next we consider the binomial stabilization condition. Let x G supp(/). Let 
Rx,x,n be equal to r plus the maximum Euclidean distance from A^rr of ver- 
tices in Q(\ 1 ^ d (X n U {x}),t) that are pathwise connected to X 1 ^ d x. Changes to 
X n at a Euclidean distance greater than X~ l l d R x \ n from x will have no effect on 
£((X 1/d x,T); X 1/d X*). 

Using (5.12), let e G (0,1/2) with (1 + e) 2 T d f max < X c . The Poisson point 
process V n{1+£) := {X ± , . . . , I M „ (1+£) }, is stochastically dominated by "H„/ max (i+ e ) 
(we say a point process X is stochastically dominated by a point process y if there 
exist coupled point processes X', y' with X' C y' almost surely and X' having the 
distribution of X and y' having the distribution of y). Hence by scaling, X l l d V n (\ +£ ) 
is stochastically dominated by "Hn/ max (i+ £ )/A, and hence we have for n < A(l + e) 
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that \ l l d V n {i+s) is stochastically dominated by %/ max (i+ £ )2. Therefore for u > 0, 

P[R X ,X,n >U}< P[M n(l+e) < U] + P[i?((l + £ ) 2 /max) > u - r]. (11.5) 

By scaling, the second probability in (11.5) equals the probability that there is a path 
from the origin in ^("H T d (i+£) 2 / max U 1) ^° a point at Euclidean distance greater 
than r _1 M — 1 from the origin. By the exponential decay for subcritical continuum 
percolation, (see e.g. Lemma 10.2 of [18]), this probability decays exponentially in 
u (and does not depend on n). 

Let A := diam(supp(/)) (here assumed finite). By Lemma 8.1, the first term 
in the right hand side of (11.5) decays exponentially in n. Hence, there is a finite 
positive constant C, independent of A, such that provided we have n > (1 — e)X 1 ^ d 
we have for all u < A 1/(i (A + r) that 

P[M n( i +e) < n] < Cexp(-C- 1 A 1/d ) < Cexp(-((A + r)CTM- 

On the other hand P[R xX n > u] = for u > A 1 / d (A + r). Combined with (11.5) this 
shows that there is a constant C such that for all (x, n, A, u) G supp(/) x N x [1, , oo) 2 
with n < (1 + e)X, we have 

P[R x ,x,n >u}< Cexp(-u/C). (11.6) 

Now suppose A G T3, and x G supp(/). Let R x ,x,n,A be equal to r plus the maxi- 
mum Euclidean distance from \ l l d x of vertices in Q(X 1 ^ d (X n U ^4U {x}); r) that are 
pathwise connected to X^^x. Changes to X n U A at a Euclidean distance greater 
than A-^i^A,^ from x will have no effect on £((X 1/d x,T); X 1 / d {X* U ^*)); that 
is, (11.1) holds. To check the tail behaviour of R x ,x,n,A, suppose for example that A 
has three elements, xi, x 2 and x 3 . Then it is not hard to see that 

R X ,X,n,A — Rx,X,n Rxi,X,n Rx2,X,n Rx3,X,m 

and likewise when A has fewer than three elements. Using this together with (11.6), 
it is easy to deduce that there is a constant C such that for all (x, n, A, A, u) G 
supp(/) x N x %, x [1, oo) 2 with n < (1 + e)X, and we have 

P[R x ,x,n,A >u}< Cexp(-«/C). (11.7) 

In other words, £ is binomially exponentially stabilizing. 

Next we check the moments condition (11.3), with p = 4 and using the same 
choice of e as before. By our definition of £ and the growth bound (5.13), we have 
for all (x, n, A, A) G supp(/) x N x % x [1, oo) 2 with n < A(l + e) that 

E MX^x, T); A 1 / d (Ar n * U ^*)) 4 ] < 7 4 E [(card(C) + diam(C)) 47 ], (11.8) 
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where C is the vertex set of the component of Q(\ l l d (X n U A U {x}); t) containing 
X 1 l d x. By (11.7), there is a constant C such that for all (x,n,A,X,u) G (supp(/) x 
N x % x [1, oo) 2 with n < A(l + e) we have 

P[diam(C) > u] < Cexp(-u/C); (11.9) 

moreover, 

P[card(C) > u] < P[diam(C) > w 1 /^] + P[card(A' n n B(x; X~ l/d u 1/{2d) )) > u - 4] 

(11.10) 

and the first term in the right hand side of (11.10) decays exponentially in u 1 ^ 2 ^ 
by (11.9). Since card(^ n n B(x; A _1/ V /(M) )) is binomially distributed with 

E [card(^ n n B(x; \~ 1/d u 1/( - 2d) ))] < u l ' 2 uj d f m ^n/\ 

by Lemma 8.1 there is a constant C such that for all (x, n, X,u) with n < A(l + e) 
we have that 

P[card(A' n n B(x; X'^u 1 /^)) > u - 4] < C exp(-C'- 1 M 1 / 2 ). 

Thus by (11.10) there is a constant, also denoted C, such that for all (x, n, A, X,u) 
with n < A(l + e) we have 

P[card(C) > u] < Ce^{-C- l u- l/{2d) ), 

and combining this with (11.9) and using (11.8) gives us a uniform tail bound which 
is enough to ensure (11.3). The argument for (11.2) is similar. 

Thus our £ satisfies all the assumptions of Theorem 11.1, and we can deduce 
(5.5) and (5.9) for some a > by applying that result with A(n) = r~ d . Then by 
applying Theorem 5.1, we can deduce that a > and h(H) < oo and (5.6) holds 
whenever h(H)\b. □ 

Proof of Theorem 5.4. Suppose the hypotheses of Theorem 5.4 hold, and 
assume without loss of generality that £(x, X*) = whenever X* \ {x} has fewer 
than k elements. We assert that under these hypotheses, there exists a constant C 
such that for all (x, n, A, u) G supp(/) x N x [1, oo) 2 with n G [A/2, 3A/2] and n > k, 
we have 

P[X 1/d R K {{x,T);X*) >u]< Cexpi-C^u). (11.11) 

Indeed, if supp(/) is a compact convex region in M. d and / is bounded away from 
zero on supp(/), then (11.11) is demonstrated in Section 6.3 of [20], while if supp(/) 
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is a compact d-dimensional sub manifold- with-boundary of ~R d , and / is bounded 
away from zero on supp(/), then (11.11) comes from the proof of Lemma 6.1 of [24]. 

It is easy to see that £ is A-homogeneously stabilizing for all A > 0. Also, for 
any (x,A) G (supp(/) x 75) we obviously have R K ((x,T); X* UA*) < R K ((x, T); X*) 
and hence by (11.11), £ is binomially exponentially stabilizing, and exponential 
stabilization comes from a similar estimate with a Poisson sample. 

We need to check the moments conditions to be able to deduce (5.5) via Theorem 
11.1. With 7 as in the growth bound (5.14), we claim that there is a constant C such 
that for any A E %, any x G supp(/), and any u > 0, and for all (x,n, A, X,u) G 
supp(/) x N x % x [1, oo) 2 with A/2 <n< 3A/2, and n > k, we have 

pum^n \v\x* u a*))\ > u\ < p[ 7 (i + x^r^t)^:))-* > u \ 

< Cexp(-C~V /7 ).(11.12) 

Indeed, the first bound comes from the (5.14), and the second bound comes from 
(11.11). Using (11.12), we can deduce the moments bound (11.3) for p = 4 and 
e = 1/2. We can derive (11.2) similarly. Thus Theorem 11.1 is applicable, and 
enables us to deduce (5.5) and (5.9) for some a > 0, in the present setting. Then 
by using Theorem 5.1, we can deduce that a > and h(H) > and (5.6) holds 
whenever h(H)\b. □ 
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